Object Priors for Classifying and Localizing Unseen Actions
نویسندگان
چکیده
Abstract This work strives for the classification and localization of human actions in videos, without need any labeled video training examples. Where existing relies on transferring global attribute or object information from seen to unseen action we seek classify spatio-temporally localize videos image-based only. We propose three spatial priors, which encode local person detectors along with their relations. On top introduce semantic extend matching through word embeddings simple functions that tackle ambiguity, discrimination, naming. A embedding combines priors. It enables us a new retrieval task retrieves tubes collections based user-specified objects, relations, size. Experimental evaluation five datasets shows importance priors actions. find persons objects have preferred relations benefit localization, while using multiple languages filtering directly improves matching, leading state-of-the-art results both localization.
منابع مشابه
Classifying Unseen Cases with Many
Handling missing attribute values is an important issue for classiier learning, since missing attribute values in either training data or test (unseen) data aaect the prediction accuracy of learned classi-ers. In many real KDD applications, attributes with missing values are very common. This paper studies the robustness of four recently developed committee learning techniques, including Boosti...
متن کاملClassifying Unseen Cases with Many Missing Values
Handling missing attribute values is an important issue for classiier learning, since missing attribute values in either training data or test (unseen) data aaect the prediction accuracy of learned classiiers. In many real KDD applications, attributes with missing values are very common. This paper studies the robust-ness of four recently developed committee learning techniques, including Boost...
متن کاملClassifying and Coding Online Actions
Research on how the Internet is diffusing across the population has broadened from questions about who uses the medium to what people do during their time online. With this change in focus comes a need for more detailed data on people’s online actions. The author provides a method for coding and classifying users’online information-seeking behavior. The author presents an exhaustive list of way...
متن کاملClassifying Facial Actions
The Facial Action Coding System (FACS) [23] is an objective method for quantifying facial movement in terms of component actions. This system is widely used in behavioral investigations of emotion, cognitive processes, and social interaction. The coding is presently performed by highly trained human experts. This paper explores and compares techniques for automatically recognizing facial action...
متن کاملComparing pixel-based and object-based algorithms for classifying land use of arid basins (Case study: Mokhtaran Basin, Iran)
In this research, two techniques of pixel-based and object-based image analysis were investigated and compared for providing land use map in arid basin of Mokhtaran, Birjand. Using Landsat satellite imagery in 2015, the classification of land use was performed with three object-based algorithms of supervised fuzzy-maximum likelihood, maximum likelihood, and K-nearest neighbor. Nine combinations...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Vision
سال: 2021
ISSN: ['0920-5691', '1573-1405']
DOI: https://doi.org/10.1007/s11263-021-01454-y